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LIMIT SHAPES OF BUMPING ROUTES 
IN THE ROBINSON-SCHENSTED CORRESPONDENCE 

DAN ROMIK AND PIOTR SNIADY 



^D Abstract. We prove a limit shape theorem describing the as- 



ymptotic shape of bumping routes when the Robinson- Schensted 
algorithm is applied to a finite sequence of i.i.d. f/[0, 1] random vari- 
ables followed by an insertion of a deterministic number a. The 
bumping route converges after scaling, in the limit as the length of 
^^ the sequence tends to infinity, to an explicit, deterministic curve 

Cn depending only on a. This extends our previous result on the 

asymptotic determinism of Robinson-Schensted insertion. 

o 
u 

c3 1. Introduction 



Let &n denote the symmetric group of order n. Recall that the 
Robinson-Schensted correspondence associates with a permuta- 
tion a G &n a pair of standard Young tableaux (P, Q) whose common 



0^ shape A is a Young diagram of order n. A well-known area of study 

^ concerns asymptotic properties of the Robinson-Schensted shape A and 

(^ the tableaux P, Q associated with a permutation (T„ chosen uniformly 

^ at random from (5„. The study of such problems is motivated by di- 

O verse connections with the so-called Ulam- Hammer sley problem on the 

l_2 typical length of a longest increasing subsequence in random permu- 

r| tations, the representation theory of ©„, interacting particle systems, 

• ^ random growth models and random matrix theory |Roml3j . 

<S In this paper we continue this line of investigation by studying the 

C^ bumping route computed during the application of an insertion pro- 

cedure, which is the fundamental building block of the Robinson-Schen- 
sted correspondence. Let us recall the relevant definitions. If A is a 
Young diagram of order n and Xi, . . . ,Xn are distinct real numbers, an 
increasing tableau of shape A and entries given by Xi, . . . ,Xn is a 
filling of the boxes of A with the numbers Xi, . . . ,Xn that is increasing 
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Figure 1. Inserting a number into a tableau results in a 
cascade of bumping events. The bumping route is the 
sequence of positions where a bumping occurred. 



along rows and columns. Given such a tableau P and a number z dis- 



tinct from Xi, 



I Xr. 



the insertion procedure applied to P and z 



produces a new increasing tableau P' with entries xi, . . . , Xn, z whose 
shape A' is obtained from A by the addition of a single box. The new 
tableau P' is computed by performing a succession of bumping steps 
whereby z is inserted into the first row of the diagram, bumping an 
existing entry from the first row into the second row, which results in 
an entry of the second row being bumped to the third row, and so on, 
until finally the entry being bumped settles down in an unoccupied 
position outside the diagram A. An example is shown in Figure [l] We 
assume that the reader is familiar with the details, which are described 
in several well-known sources such as |Ful97t IKnu98| ISta99] . 

Define the bumping route Bpz associated with an insertion proce- 
dure performed on the tableau P with a new input z to be the sequence 
of positions where a bumping occurred during the insertion, together 
with the position of the final box added to the shape. The jth po- 
sition in the bumping route is of the form {bpz{j),j), so it is conve- 
nient to encode the bumping route using only the x-coordinates of the 
positions, which form a monotone nonincreasing sequence of positive 
integers 6p,^(l) > fep,2(2) > . . . > &p,^(fcp,2) whose length we denote 
by kp^z- For example, the bumping route associated with the insertion 
step in Figure |l| is (4,3,2,2,2). 

If an is a uniformly random permutation of order n, the bumping 
route computed in the last insertion step is Bp„_j^^anin), where Pn-i 
denotes the insertion tableau computed from the truncated sequence 
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(cr„(l), . . . , o"n(n — 1)). The question we wish to address is that of 
understanding the asymptotic behavior of this bumping route. 

Since the computation only depends on the relative order of the num- 
bers o"n(j), 1 < j < n, it will be equivalent, and more convenient, to 
formulate the result in terms of a sequence Xi , . . . , X„ of independent 
and identically distributed random variables with the ^[0, 1] distribu- 
tion, which gives a canonical way of realizing uniformly random order 
structures of all orders n simultaneously. If we denote by T„ the in- 
sertion tableau associated with the sequence of numbers (Xi, . . . , X„), 
then the bumping route BTn-i,x„ is equal in distribution to Bp^_-^_o-„(n)- 

Our result will pertain to an even more general scenario in which the 
final input X„ is taken to be an arbitrary (non-random) number a in 
[0, 1]. Note that, by obvious monotonicity properties of the insertion 
procedure, as a increases from to 1, the bumping route is deformed 
monotonically between the two extreme cases a = and a = 1, where 
in the case a = the bumping route will be the first column of the 
diagram and an additional new box at the top of the first column, and 
in the case a = 1 the bumping route consists of a single new box at 
the end of the first row of the diagram. Note also that the bumping 
route is contained in the Young diagram of the tableau T„; this ran- 
dom Young diagram, whose distribution is known as the Plancherel 
measure of order n, converges to a well-known limit shape discov- 
ered in the celebrated works of Logan-Shepp |LS77j and Vershik-Kerov 



|VK77t IVK85j . Figure 2a shows the bumping routes 6^^^ q, for various 
values of a in a numerical simulation with n = 10^. As it turns out, 
the bumping routes converge after scaling to a family of deterministic 



limiting curves, which are shown in Figure 2b The precise result is as 
follows. 
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Figure 2. (a) Bumping routes Bj;^_^ ^^ in a simulation 
with n =W and the values a = 1/10, 2/10, . . . , 9/10; 
(b)|the limiting curves (/3q,(s), s) for the same values of 



a. The region bounding the limit shapes is the Logan- 
Shepp-Vershik-Kerov limit shape of Plancherel-random 
Young diagrams |Roml3j . 



Theorem 1 (Limit shapes of bumping routes). Define the functions 

r , \L, Ua, Va, Xq,, tja) ^ "?/ 



(1) r2(u) = — I usm 

n 



(2) F(u) 



1 1 fuVA^ 



w 



(3) u^{t) = ViF-^ ( 

(4) v^{t) = Vin(^F-^ (^ 

(5) Xait) 



2 TT V 4 

a 

1 
a 

J 

Va{t) +Uait) 



(6) Vait) 



Vait) - U^{t) 



+ sm 



-1 



u 



(7) K(a) = y^{l) 



n{F~\a))~F 



-1, 



a] 



(l«l<2), 

(l«l<2), 
(0 < a < t < 1), 
(0 < a < t < 1), 

(0 <a < t < 1), 

(0 < a < t < 1), 

(0< a< 1). 
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Define a one-parameter family {(^a(t))„^^^ of functions, where for 
each a G [0, 1), /3a(-) is given by 

(8) (3a{s) = x^iy-\s)) (0<s </£(«)). 

Then Pa{') describes the limiting bumping route Br„_^^Q,, in the follow- 
ing precise sense: for any < a < 1 and e > 0, we have that 



(9) P 



k 



T„—,,a 



K[a] 



n 



> e 



(10) P ( max 

^l<m<fcT„_i,Q 



^T„~i,a 



m] 



n 



-l^a 



-> 0, and 



m 



A Kla, 



n 



> e 



^0. 



2. Preliminary remarks 

As a first step towards proving Theorem [T| let us recall some facts 
from the theory of Plancherel measure, which will immediately prove 
^ and also help elucidate the somewhat involved definition of the fam- 
ily of limiting curves Pa{-)- First, when discussing Plancherel-random 
Young diagrams and Young tableaux it is convenient to use the so- 
called rotated (also known as "Russian" ) coordinate system, related to 
the standard x-y coordinates by the linear change of variables 



(11) 



u = X — y, 
V = X + y. 



In this coordinate system, the curve v = Q{u), where Q{u) is de- 
fined in ([I|, describes the Logan-Shepp-Vershik-Kerov limit shape of 
Plancherel-random Young diagrams mentioned in the introduction. 

Second, the function F{u) defined in ^ is the cumulative distri- 
bution function of the semicircle distribution on [—2,2]; that is, we 

have 

1 



F(u) 



2n 



V4: - s2 ds (|m|<2). 



-2 



Its importance for the present discussi on is t hat, according to one of 
the main results of our previous paper JRSllj . the point 



(12) 



{U{a),V{a)) = {F-\a)MF-\a))) 



is the limiting scaled position (in rotated coordinates) of the new 
box added to the Robinson-Schensted shape after applying the in- 
sertion procedure with the number a G [0, 1] to the existing inser- 
tion tableau T„_i. Mo re pre cisely, when stated using our current 
terminology, the result RSll[ Theorem 5.1], which we dubbed the 
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"asymptotic determinism of RSK insertion," says that the last 
position (6a(/cT„_i,„),fcr„_i,„) of the bumping route Br„_i,« satisfies 






(f/(«),V(«)). 



After applying the inverse transformation of (11) to rewrite the result 
in x-y coordinates, and noting that 

V{a) - U{a) 

«^(«) = Vail) = 2 ' 

we get the relation (pi) , the first claim of Theorem QJ 

Next, turn to the pair of functions {ua{t),Va{t)) defined in ([s])- 
(|4]), with the associated pair {xa{t),ya{'t)) from ([5])-(|6]) representing 
the same functions in x-y coordinates. Note that for any fixed a, 
the planar curve (xait) , yait)) ^^^^^ is a reparametrized version of the 



curve (/3a (s), ■^jg^^^^. „ which according to our claim (10) is the limit 

shape of the bumping route BT„-i,a (one needs to note that ya{-) is a 
strictly decreasing function; see Lemma [2] below). It turns out that the 
parametrization of the curve as (^Xa{t),y ait)) is the correct one when 
trying to prove the limit shape result (although the parametrization 
(/3q,(s),s) is the one that answers the original question). To see why, 
we need to explain the role of the parameter t. Note that for fixed 
t, the points {uait),Va{t))^^^^^ all lie on the curve v = \/ifl{u/\/i), 

which is a copy of the limit shape v = Q{u) scaled down by the fac- 
tor y/i. The idea is that this scaled-down copy represents the limiting 
shape of the "t-sublevel tableau" of T„_i — that is, the subset of boxes 
of T„_i containing an entry < t. We will show below that the point 
{uait),Vait)) (or (xQ,(t), ?/a(t)), in the usual coordinate system) corre- 
sponds to the limiting position, after scaling, of the point at which the 
bumping route B^^^ „ exits this sublevel tableau. The reason for this 
is that this exit position relates to the sublevel tableau in roughly the 
same way that the final bumping route position (&a(/cT„_i„), ^t„_i „) 
relates to the entire tableau T„_i, except that there is a scaling rela- 
tion that causes the number being inserted to change from a to a/t. A 
more precise formulation of this statement is discussed in the next sec- 
tion, after which we will see that our main result follows without much 
difficulty by another appeal to the "asymptotic determinism of RSK" 
theorem. A schematic illustration of the argument described above is 
shown in Figure [3j 
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K{a) 



s=ya{t) 




l3a{s)=Xa{t) 



Figure 3. The meaning of the parameter t: for a fixed 
value of a, the intersection ('Ua(t),fa(t)) of the as- 
ymptotic bumping curve with the scaled-down copy 
V = \/tVt{u/\/t) of the Logan-Shepp-Vershik-Kerov limit 
shape is computed by applying the asymptotic determin- 
ism theorem to the t-sublevel tableau; when the numbers 
in the sublevel tableau are scaled to the range [0, 1], the 
number a being inserted is scaled to a/t. 



We conclude this section with another small but useful observation. 

Lemma 2. The function ya{-) is strictly decreasing. In particular, the 
limit shape functions (3a{-) defined in (IS]) are well-defined. 

Proof. Denote 

{X{a),Y{a)) = ^{V{a) + U{a),V{a)-U{a)), 



where U{a),V{a) are defined in (12). First, note that Y{-) is strictly 



decreasing, since it is the composition of the increasing function a i— ;■ 
F~^{a) with the function u i-)- ^{fl{u) —u), the latter being decreasing 
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(this can be seen by interpreting this function geometrically, or by 
differentiating and using the fact that Q'{u) = - sin~^(M/2)). Now, if 
t < t' are numbers in [a, 1] then 



y^{t) = VtY{a/t) < Vt'Y{a/t) < ^fb'Y{a/t') = y^{t'), 
proving the claim. D 

3. Plancherel measure and sublevel tableaux 

Recall that the distribution of the insertion tableau P„ associated 
via the Robinson-Schensted correspondence with a uniformly random 
permutation a^ in ©„ is also called the Plancherel measure of order n 
(this refers to the measure on standard Young tableaux; the distribu- 
tion of the shape of the tableau is the Plancherel measure on Young 
diagrams of order n that was mentioned in the introduction). By the 
remarks made in the introduction, a tableau P„ with this distribution 
arises by taking the tableau T„ and "standardizing" it by replacing 
each entry Xk with the ranking of Xk in the list Xi, . . . , X„ (i.e., the 
number a{k) such that X^ = X^^K, where X|" < . . . < Xn are the 
order statistics of Xi, . . . , X„). Note also that the vector of order sta- 
tistics is independent of the tableau P„ and is distributed uniformly on 
the simplex 

A„ = {{xi, . . .,Xn) ■■ < xi < . . . < a;„ < 1}. 

It follows that, conversely, if we start with a random standard Young 
tableau P„ distributed according to the Plancherel measure of order n 
and a random vector {Wi, . . . , Wn) that is independent of P„ and is 
distributed uniformly on A„, then the tableau T^ obtained by replacing 
each entry p of P„ by Wp is equal in distribution to T^. 

We will now apply these observations to prove a simple lemma about 
sublevel tableaux. As mentioned above, for any < t < 1, the t- 
sublevel tableau of T„_i, which we denote by T!^1i, i^ ^^^ subtableau 
of T„_i consisting of those boxes with entries < t. An essential fact 
that makes our main result possible is a self-similarity property that 
says that T.l\ is distributed roughly as a scaled version of T^ for k ~ 

tn. Since the number of entries in T^_i is itself random, we need to 
condition on its value to make this statement precise. The details are 
as follows. 

Lemma 3. Let < t < 1 and < k < n — 1. 
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The number \T^li\ of boxes in T!^_i satisfies 

(t) I 



(14) ^^ -^ t. 

n — 1 n->oo 

• Conditioned on the event \T^ili\ = k, the rescaled sublevel tableau 
\Tn-i (where the rescaling means that each entry ofT^l^ is di- 
vided by t) is equal in distribution to T^. 

Proof. Recall that the entries of T„_i are the i.i.d. uniform random 
numbers Xi, . . . ,X„_i, so the entries of the sublevel tableau T^_i con- 
sist of the subset of the numbers Xi, . . . , X„_i which are < t. It follows 
that the random variable Zn = |T^_i| is given by 

rt— 1 
Zn = 2_,l{X,<t}) 

SO the first claim follows from the law of large numbers. 

Denote by (Yi, . . . , Yz„) the sequence (of random length Zn) of those 
X/s (for 1 < j < n — 1) for which Xj < t, in the order in which they 
appear. From elementary probability theory, it is easy to see that, 
conditioned on the event {Zn = k}, the random variables Yi, . . . ,Yk 
are independent and uniformly distributed in [0,t]. But now observe 
that (still conditioning on {Z„ = k}) the sublevel tableau T^li can be 
constructed as follows: 

Standardize: 

Replace T„_i by a standard Young tableau P„_i, where each 

entry of Pn-i is the ranking of the corresponding entry of T„_i 

in the list Xi, . . . , X„_i. 
Sublevel: 

Take the A;-sublevel tableau -P„_i of Pn-i- 
Destandardize: 

Replace each entry p of P„„i by the pth order statistic ly of 

the sequence Yi, . . . , Y^. 

By the remarks made at the beginning of this section, P„_i is a Plancherel- 
random standard Young tableau of order n— 1. A well-known property 
of the Plancherel measures is that they are a consistent family of proba- 
bility measures, in the sense that for any k < m, the /c-sublevel tableau 
of a Plancherel-random tableau of order m is a Plancherel-random 
tableau of order k; this follows, for example, from the observation 
that the recording tableau associated via the Robinson-Schensted cor- 
respondence to a sequence of length m (respectively, k) of i.i.d. f/[0, 1] 



10 
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random variables is a Plancherel-randoni standard Young tableau of 
order m (respectively, k). So, the tableau -P„_i is distributed according 
to the Plancherel measure of order k. Finally, since (still condition- 
ing on the event {Z„ = k} as before) the vector of order statistics 
{yI''\ . . . , Y^''^) = (X{"\ . . . , Xf ^) is independent of P„_i (and hence 
also of Pn-i) and is distributed like t times a uniformly random vector 

in Afc, again by the remarks made above we have that T^li is (condi- 
tionally on {Zn = k}) equal in distribution to t ■ T^. D 



4. Finishing the proof 



To prove (10), we first reparametrize the bumping route BT„_i,a ac- 



cording to the parameter t associated with the sublevel tableaux. For 
each < a < 1, this reparametrized bumping route will now be a 
random function ^n,a '■ [a, 1] — )■ N x N defined by 



(15) 



^n,a(t) = (&T„_i,a("^),"^) 



where for each t we denote by m the minimal number for which 
(bTn-i,a{fTT'),fTT') hes outsidc the sublevel tableau T^li- Note that al- 
most surely we have that 

$n,a(a)=(&T„_i,a(l),l), 

*n,a(l) =(&T„_i,a(A;r„_i,a),fcT„_i,a), 

and the range of $„_„ consists of the entire bumping route BTn-i,a- 
Theorem 4. For any < a < 1 and e > we have 



(16) P ( max 



^n,a\'^ ) 



n 



{Xait),yait)) 



> e 



-^0. 



Proof. Fix a G [0,1). First, we prove the weaker statement that for 
any e > and t G [a, 1] we have 



(17) 



P 



*n,a(t) 



n 



- {Xait),yait)) 



> e 



^0. 
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11 



Denote Z„ 



-n-l\ 



|T!;"M as before, and let 5 > be some small number 



(depending on e) whose value will be fixed shortly. We have 



P 
< P 



- {Xa{t),ya{t)) 



> e 



n — 1 



>6 



< P 



--t\<s 



n — 1 



$n,a(t) 



n 



- {Xait),yait)) 



> e 



>6 



+ max P 



~t\<5 



^n,ay^ ) 



- {Xait),yait)) 



> e 



In the last expression, the first term tends to as n ^ oo, by (14). 
Let k = k{n) be the value for which the maximum of the second term 
is attained. Note that the second claim of Lemma [3] implies that the 
conditional probability in the second term can be replaced by its un- 
conditional counterpart 



(19) P 



$fc+i,„/i(l) 



- {Xait),yait)) 



> e 



PI xl^^^^^^-V~t{x^/,il),y^/til)) >e 

If we had the precise equality k = tn, it would immediately follow from 
(13) that this probability tends to as n (and therefore also k) tends 
to oo. As it is, such an equality does not hold; however, we restricted 
fc to a range such that 

k k 

t — 6 < lim inf — < lim sup — < t + S. 

n-s>oo n n-5.00 n 

This is good enough, since it is easy to check that if 6 is taken (as a 
function of e) to be a small enough positive number, then the right- 
hand side of (flQJ) can be bounded from above by 



(20) P 



$ 



k+l,a/t 



(1) 



Vk 



{Xa/t{l),ya/t{i)) 

+ P 



> 



$ 



k+l,a/t 



(1) 



Vk 



>3v^ 
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The first probability tends to as n ^^ oo by (13). The second prob- 



ability is bounded by the probability that a Plancherel-random Young 
diagram of order k has a row or column of length > Syk; it is well- 
known that this probability decreases to at a rate that is exponential 
in yk (see |Roml3t Lemma 1.5]). Thus, combining these observations 
with (pl, (jigl) and the bound (pOl) proves ^. 



Finally, to finish the proof we need to show that (17) implies (16). 



This is a standard argument: first, (17) clearly implies a version of 



(16) in which the maximum is taken over finitely many values a < 
ti < . . . < tp < 1 oi t. Second, since both the functions t h-)- ^n,ait) 
and t i-T- (^Xait),yait)) have the property that their x-coordinate is 
weakly decreasing and their y-coordinate is weakly increasing, and since 
t I-)- [xa{t),ya{t)) is continuous, knowing that the bound 



n 



{Xait),yait)) 



> e 



holds for all values of t in a finite set that is sufficiently dense in [a, 1] 
ensures that the same inequality (with e replaced by, say, 2e) will hold 
for all t G [a, 1]. The details are easy and are left to the reader. D 

Proof of ([Io|). It is now easy to derive (10) from (16). The idea is that 



the relation between t and m in (15) can be inverted, expressing the 
mth point (&T„_i,«(^7i),r7i) of the bumping route as $„_Q,(t(m)) where 



t{m) is the minimal value t > a for which the y-coordinate of ^n,ait) is 
equal to m. (Note that t(-) also depends on n and a, but for convenience 
we leave this dependence implicit in our notation.) Expressing ( [l6) ) in 
terms of t(r7i) gives the convergence in probability 



max 

l<m<fcr 



(&T„_i,a(rn),m) 



n 



{xa{t{m)),ya{t{m))) 



-^0, 



which can be broken down into two separate convergence relations. 



(21) 



(22) 



max 

l<m<A:y 



max 



ya{t{m)) 



m 



-^0, 



l<m<kT 



-i,a 



m] 



n 



x^{t{m)) 



-^0. 



Now observe that z i— )■ y^^{z A /«(«)) is a continuous function on 



[0,2fi;(a)]. Combining this with (21) and the fact that (by (|£ 
kT„-i,a/ y/n < 2K{a) with asymptotically high probability, we see that 



max 

l<m<fej' 



771 



/ \ — 1 / / \ 



->0. 
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Finally, this relation, together with (22) and the fact that Xa{-) is 
continuous, implies that 



max 

l<m<kT 1 






n \ \ \/n 



-^0, 



which is exactly (10). D 
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